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Abstract: For the balanced ANOVA setup, we propose a new closed form 
Bayes factor without integral representation, which is however based on 
fully Bayes method, with reasonable model selection consistency for two 
asymptotic situations (either number of levels of the factor or number of 
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marginal density under a special choice of the priors enables such a Bayes 
factor. 

AMS 2000 subject classifications: Primary 62F15, 62F07; secondary 
62A10. 

Keywords and plirases: Bayesian model selection, model selection con- 
sistency, fully Bayes method, Bayes factor. 



1. Introduction 

We start with a simple one-way balanced ANalysis-Of-VAriance (ANOVA). 
There are two possible models. In one model, all random variables have the same 
mean. In the other model, random variables in each level has a different mean. 
Formally, the independent observations j/y (i = 1, . . . ,p, j = 1, . . . , r, n = pr) 
are assumed to arise from the linear model: 

Ui] = M + "i + e-y- e-y ^ ^(0, o'^) (1-1) 

where ai (i = 1, . . . ,p) and cr^ are unknown. We assume ^ = as unique- 
ness constraint. Clearly two models are written as follows: 

Ml : a = (ai, . . . ,ap)' = 
vs Ma+1 ■■ ae {ae 7^P|a ^ 0, a'lp = 0}. 

In (|1.2p , A means the name of the factor and the subscript ^ -f 1 is from the 
fact that E[yij] in (jl.ip consists of the sum of the constant term and the level 
of the factor. 

In this paper, we will consider Bayesian model selection based on Bayes factor 
for ANOVA problem. Model comparison, which refers to using the data in order 
to decide on the plausibility of two or more competing models, is a common 
problem in modern statistical science. In the Bayesian framework, the approach 
for model selection and hypothesis testing is essentially same, whereas there 
is a big difference in classical frequentist procedures for model selection and 
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hypothesis testing. A natural approach is to use Bayes factor (ratio of marginal 
densities of two models ), which is based on the posterior model probabilities 
( Kass fc Raftervl (|l995l) i That is the reason why we take Bayesian approach 



based on Bayes factor in this paper. 

One of the most important topic on Bayesian model selection is consistency. 
Consistency means that the true model will be chosen if enough data are ob- 
served, assuming that o ne of the competing models is true. It is well-known that 
BIC by [Schward (|l978h has consistency in classical (so called "n > p") situa- 



tion. As a variant of "p > n" problem, which is hot in modern statistics, the 
consistency in the case where j p ^ o o an d r is fixed in one-w ay ANOVA setup, 
has been considered bvlSto ii3 (|l979f l and lBerger et all (j2003l ). In the following, 



"CASE I" and "CASE 11" denote the cases where 

I. r goes to infinity and p is fixed, 
II. p goes to infinity and r is fixed, 

respectively. Under known a and CASE II, IStond (|l979h showed that BIC al- 



ways chooses the null model Mi (that is, BIC is not consistent under Ma+i) 
even if a.'ot/{pa'^} is sufficiently large. This is reasonable because BIC is orig- 
inally deri ved by the Laplace approximation under classical situation. Under 
known tr^, iBerger et al.l ( 20031) proposed the Bayesian criterion called GBIC, 



which is derived by the Laplace approximation under CASE II. Then they 
showed that GBIC has model selection consistency under CASE II. 

Generally, the original representation of Bayes factors or marginal densities 
involve integral. In the normal linear model setup, even if conjugate prior is 
used, hyperparameter and its prior distribution are usually introduced in order 
to guarantee objectivity, which is called fully Bayes method. (On the other hand, 
in empirical Bayes method, maximization of the conditional marginal density 
given hyperparameter with respect to hyperparameter is applied.) Since find- 
ing a prior of hyperparameter, which enables analytical calculation completely, 
is considered as extremely hard, the Laplace approximation has been applied. 
Needless to say, the Laplace approximation needs some assumptions, in particu- 
lar, on "what goes to infinity" . However, when both p and r are large (or small) 
in analysis of real data, the answer to the question which type of the Laplace ap- 
proximation is more appropriate, is obscure. Moreover an approximated Bayes 
factor under one assumption does not necessarily have consistency on the other 
assumption, which is not good for practitioners. Therefore Bayes factor 

1. without integral representation, which is however based on fully Bayes 
method, 

2. with model selection consistency for two asymptotic situations, CASE I 
and II 

is desirable, which we will propose in this paper. Actually, a special choice of 
the prior of hyperparameter, which completely enables analytical calculation of 
the marginal density, is the key in the paper. 
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Eventually the Bayes factor which we recommend is given by 
BF \M ... i_ W2)r(p(.-l)/2) 

where 

We _ Y^rMj-m-f _ _ Ej _ _ E»j y^j 

Wt Y.zj(.yij - y -y r ' ^" pr 

It is not only exactly proportional to the posterior probability of M.a+i, but also 
a function of We/Wt, which is fundamental aggregated information of one-way 
ANOVA, from the frequentist viewpoint. 

The rest of the paper is organized as follows. In Section[2l we review methods 
for Bayesian model selection based on Bayes factor, give a concrete form of the 
prior we use for one-way balanced ANOVA and eventually propose the Bayes 
factor given by ()1.3|) . In Section[3l we show that the Bayes factor has a reasonable 
model selection consistency. In Section SI we extend results in Section [2] and [3] 
to two-way balanced ANOVA setup. 



2. A Bayes factor for one-way ANOVA 



In this section, we will propose the Bayes factor for one-way ANOVA, which is 
given by (II. 3|) . In Sub-section 12. li we first re-parameterize the ANOVA model 
given by p.ip and then give priors for it. In Sub-section [221 we derive marginal 
densities under A4i and A4a+i and eventually the Bayes factor using them. 



2.1. re-parameterized ANOVA and priors 

Let 

y = {yii, ■ ■ ■,yir,y2i, ■ ■ ■,y2r, ■ ■ ■,ypi, ■ ■ ■,ypr)' and x = ip® i^. 

Then the linear model given by is written as 

y = /.tin + X.a. + e. 

Further the centered matrix of X, X — p^^l„lp, is decomposed by the singular 
value decomposition as 

y/rUW' = Vr{ui, . . . . . .,Wp_iy 

where U and W are n x [p— 1) and px (p— 1) orthogonal matrices, respectively. 
Let 9 = ^/rW'o^ G TiP^^ . Then the one-way ANOVA is re-parameterized as the 
linear regression model 

V = /il„ + Ue + e (2.1) 
and two models are written as Aii: 6 — Op_i and A4a+i- 9 G TZ^^^ \ {Op_i}. 
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In model selection in the Bayesian framework, the specification of priors 
are needed on the models and parameters in each model. For the former, let 
Pr(A^i) = Pt{Ma+i) = 1/2 as usual. For the latter, at moment, we just write 
joint prior densities as p{n,a'^) for M\ and p{fj.,6,a^) for Ma+i- From the 
Bayes theorem, Ma+i is chosen when Pr{MA+i\y) > 1/2 where 

PT{MA+i\y) - ^ + ^ 



1 + BF{Ma+i;Mi) 

and BF{A4a+i', -Ml) is the Bayes factor given by 

BF{MA+i;Mi)=mA+i{y)/mi{y). (2.2) 

In other words, A4a+i is chosen if and only if BF(A^^+i; A^i) > 1. In 
mj(y) is the marginal density under M-y for 7 = 1,^ + 1 as follows: 



miiy) = JJ )p{p-,(T )dfida 

mA+iiy) ^ p{y\n,e,a'^)p{fi,9,a'^)dfid9da'^, 



where cr^) and p{y\^,0,a^) are sampling densities of y under Mi and 

Ma+1, respectively. 

In this paper, we use the following (improper) joint density 

pi^,,c7^)=pi^i)pia^)^lxa-^ (2.3) 

for Ml and 

pifi, e, a') = pUi)p{a^)p{6\a^) = — / p{6\g, a^)p{g)dg (2.4) 

for Ma+1- Note that p{^)p{a^) — a^^ in both (|2.3p and (|2.4[) is a popular non- 
informative prior. It is improper, but justified because /i and are included in 
both Ml and Ma+i- The proper densities p{9\g, ct^) and p{g) will be specified 
in the next subsection. Recall that we will consider fully Bayes method, which 
means the joint prior densities p(/i, u^) and p(/i, 6, a^) do not depend on the ob- 
ser vations y. On the other h and, the prior densities for empirical Bayes method 



do ( George k Fosteij (200] 



2.2. Marginal densities and the Bayes factor 

First we derive the marginal density under M 1 . Using the Pythagorean relation 

\\y - fjilnf = n{y.. - ^if + Wt, 
where y.. = J^ij Vv and Wt = \\y ~ y-ln|P = J^ijiVv ~ V- f^ we have 

roo 1 / ||y_^l„||2^ 



.^L (^^O^VV^^^PV 2a 



dfida 
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.1/2 



(27r)"/2-i/2 7p , 

ni/^r({n- l}/2) 
7r("-i)/2 



yn+l 



exp 



2^ 



'(n-l)/2 



Notice that Wt is called "total sum of squares" in the ANOVA context. The 
total sum of squares, Wt, is identically partitioned as the sum of "within group 
sum of squares" We and "between group sum of squares" Wh as follows: 



WT^\\v-yf + \\y-v..inf 



y-f 



where = r ^ Y.j Vij for i = 1, . . . ,p and y = (yi., . . . y^.)' (g) 1^. 

Then we derive the marginal density under A4a+i- As explained in the pre- 
vious subsection, (|2.4p is t he form of the joint density we use in this paper. As 
p{9\g, a^), we use so-called Zellneil ( 1986[) 's g-prior 

p{e\a^g) = iVp_i(0,ga2([/'i7)-i) = 7Vp_i (0, ga^ Jp_i). 

There are many p a pers which use q-prior s inclu ding iGeorge fc Foster 

Liang et al. ( 2008h . Maruvama fc Georgel (12008^ in Bayesian model selection 



co ntext. In the S t ein es timation conte xt, this type of shrinkage p riors is known 
StrawdermanI (jl97lh 's prior. See iMaruyama fc StrawdermanI (|2005» ) for the 



as 



detail. 

Using the relationship 



\y-^iln-uef + g-'\\ef 

= niy.. -f,f + \\y- y..l„ - C/0||2 + g-^\\e\' 



n{y.. - fif + 



1 



9 



e 



5+1 



-I- 



Wt + gWE 



we have the conditional marginal density of y given g under AAa+i 



mA+iiy\g) 



-oo Jo 



p{y\^, e, a^)p{e\a^,g)p{a^)dfiddda^ 
1 1 



\\y-f,in-uer \\e\\^ 



X exp , 2,2 

°° nV2(i+^)-(p-i)/2 
(27ra2)(n-i)/2 

(l+g)(n-rt/2 



exp 



25^2 
Wt + gWE 



p{a^)d^ideda^ 



2a^{g + l) J a 



-^da^ 



{giWE/Wr} + 1) 



(«-l)/2- 
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The (fully) marginal density is given by 

/•oo 

ruA+iiy) = / mA+i{y\g)p{g)dg 
Jo 

which is usually calculated by numerical methods like MCMC or by approxima- 
tion like Laplace method. But, in this paper, very nice analytical results will be 
derived by choosing a following special prior of g. 

As the prior of g, we use Pearson Type VI or beta-prime distribution 

p{g) = {B{a +1,6+ 1)}- V(l + g)-"-'-', (2.5) 

which is clearly proper if a > — 1 and b > —1. In particular, when b = (n — 
p)/2 ~ a — 2, we get a closed simple form of the marginal density 

/•°° (i + o)("-p)/2 
mA+i{y) = mi(y) / p{g)dg 

Jo {g{WE/WT} + ir " 
, , r(p/2 + fl+l/2)r((n-p)/2) 
-"'^^^^ r(a + l)r({n-l}/2) \Wt) 

If 6 7^ (n— p)/2 — a— 2, there remain s an integral i ncludi ng We/Wt on m^+i(y). 
Actually "hyper-5 priors" given bv iLiang et al.l ( 20081 ) corresponds to the case 
6 = 0. They had no other choice to use the Laplace approximation. 

For the choice of a, my recommendation is a = ~l/2. We will describe it 
briefly. The asymptotic behavior of p{g) given by (|2.5p . for sufficiently large g, 
is proportional to g~°'~^. From the Tauberian Theorem, which is well-known for 
describing the asymptotic behavior of the Laplace transform, we have 

/•oo 

p{e\a')= p{eW',g)p{g)dg^{a'r+'\\e\\-^P+'^+'\ (2.6) 
Jo 

for sufficiently large 6 e TZ^^^, a > —1 and 6 > — 1. Hence the asymptotic tail 
behavior of p{6\(T^) for a = —1/2 is mult i variat e Cauchy, which 
has been recommended bv lZellner fc Slow and others in objective Bayes 

context. 

Finally the Bayes factor which we recommend is written as 



RP \U • A/f 1 - "^A+ijy) 
mi[y) 

^ r(p/2)r(p(r-l)/2) (Wi 



r(l/2)r({pr-l}/2) \ Wt 



-p(r-l)/2+l/2 



(2.7) 



where the subscript FB means "Fully Bayes" . It is not only exactly proportional 
to the posterior probability Pr(A^A+i|y), but also a function of Wt and We 
only through We/Wt, which is fundamental aggregated information of one-way 
ANOVA, from the frequentist viewpoint. 
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Remark 2.1. The most advantage of BFi?B over existing Bayes factors is its ex- 
cellent closed form. Ma ny Bayes factors bas ed on fully Bayes method including 
intrinsic Bayes factor bv lCasella et al. ( 2009t ) have the closed forms. Since it usu- 



ally involves the integral in the represe ntation, they h ave to app ly the Laplace 
appro ximation in practice. For example, Casella et al.l ( 20091 ) and Moreno et al.l 



( 2009( ) applied different types of the Laplace approximation to the same Bayes 



factor with integral representation. However, the answer to the question which 
type of the Laplace approximation is more appropriate, is obscure for some cases 
(for example, in the case where both p and r are large (or small) in ANOVA 
problem). On the other hand, ^¥ fb does not require thought and has a reason- 
able model selection consistency for two cases (r — > cx), fixed p) and {p ^ oo, 
fixed r), as seen in the next section. 



3. Model selection consistency 

In this section, we consider the model selection consistency in the case where 
n = pr approaches the infinity. In concrete, we consider two cases (r — > oo, fixed 
p) and (p — *■ oo, fixed r). Generally, the posterior consistency for model choice 
is defined as 

plimPr(7W.^|y) = 1 (3.1) 

n — ^oo 

or equivalently 

plim BF[M^;My'] = oo 

n — >oc 

when is the true model and A4^' is not. Here plim denotes convergence in 
probability and the probability distribution in (j3.1D is the sampling distribution 
under the true model M-y. We will show that BFfb[Ma+i',Mi] given by (|2.7p 
has a reasonable model selection consistency. As the competitor of BFi?^, the 
Bayes factor based on BIG 

B¥bic[Ma+i;Mi] = f (pr)-^?'-!)/^ 
will be considered. 

Theorem 3.1. 1. Assume r ^ oo and p is fixed. 

(a) BFbic ^.''^d BFpB are consistent whichever the true model is. 
2. Assume p ^ oo and r is fixed. Also let ca = hmp_,oo X^iLi '^f/IP"'^}- 

(a) BFbic o,nd BFpB are consistent under Mi. 

(b) BFpB is consistent [inconsistent] under Ma+i when ca > [<]^('") 
where 



/i(r) = r^/f'-^) - 1. 



(3.2) 
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(c) BFbic inconsistent under AAa+i for any > 0. 

Note: h{r) is clearly a convex decreasing function in r which satisfies h{2) 
1, /i(5) = 0.5, /i(10) =^ 0.29, and h(oo) = 0. 

P 

Proof. In the proof, denotes convergence in probability. Note 



We We/cj^ X; 



2 

pr—p 



(3.3) 



Hence we have 

We p i + xl-i/pr) ^ under A^i 

Wt^" \ {l + under Ma+i 

when r — > oo and p is fixed and 

We l/r) under TWi 

Wt ^ \(1 - l/r)/(l + CA) under A^A+i 

when p — > cxD, r is fixed and c^i = linip^oo J2i=i '^i/{P'^^} is assumed. 
For the asymptotic behavior of the gamma function, Stiring's formula, 

T{ax + b)^ V2^e-"^(ax)"^+''-i/2 (3 5) 

for sufficiently large x is useful. Here f ~ g means \imf/g = 1. Using (|3.5p . we 
get 

r{{pr~p}/2) ^ 



(3.4) 



T{{pr - l}/2) 
when r 00 and p is fixed, and 

r{p/2)r{{pr - p}/2) V2^r 



{pr/2)'^P-'>^\ (3.6) 



r({pr-l}/2) (r_ 1)1/2 |^^r/(r-l) 



p(r-l)/2 



(3.7) 



when p — > 00 and r is fixed. 

First, we consider the consistency in the case where r 00 and p is fixed. 
Using (|3.3p and (|3.6p . we have 



r,. P /ci(p)r-(P-i)/2exp(x^_i/2) under Mi 

'''^^^f-^-'+^'-^^^^\ci(p)r-(P-i)/^(l + Ea?/W}r under M^+i, 

which goes to zero under A4i and infinity under A4a+i respectively, and 
P /c2(p)r-(P-i)/2exp(x^_i/2) under TWi 



BFbic[Ma+i;Mi] 



C2(p)r-(P-1)/2(1 +^a2/|p^2|)pr ^j^^^^, Ma+u 
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which goes to zero under A4i and infinity under Ma+i respectively. In the 
above, ci(p) and C2{p) are given by 



Cl 



(p) = (p/2)-(P-i'/2p(^/2)/r(l/2) and cab) = p-^P-'^' 



Thus part[Ta|of the theorem foUows. 

Then we consider the consistency in the case where p — > oo and r is fixed. 
Using (|3.4p and (|3.7p . we have, 



BFi.B[A^A+i;A<i] 



P [ V2r(r - l)-i/2j,-p/2 undcj. _yv4i 



which goes to zero under Mi and under Ma+i with ca < h{r). It goes to 
infinity under Ma+i with ca > h{r). On the other hand, we have 

^i/2p-(p-i)/2 f ^^Y^^ under Mi 

p/2 



i/2p-(p-i)/2 + caY) under X 



which goes to zero in both models. Thus parts Hal [2b] and [^c] of the theorem 
follow. □ 

Remark 3.1. We give some remarks on inconsistency shown in the theorem. 

1. As shown in [2c] of Theorem l3.1|, BIG always chooses Mi when p — > oo and 
r is fixed, even if ca is very large. This is interpreted as unknown variance 
version of Stone (Il979l) 's example. 

2. As seen in [2b] of Theorem 13. 1[ BFpB has an inconsistency region. Ac- 
tually existing such an inconsistency region has been also reported by 
Moreno et alT(|2009l ). They proposed intrinsic Bayes factor for normal re- 



gression model and their upper-bound of inconsistency region for one-way 
ANOVA is given by 

.-1 _^ 



(r + - 1 

which seems to be slightly smaller than h{r) given by (|3.2p . 
3. The existence of inconsistency region for small ca and large p is quite 
reasonable from the following reason. Assume new independent observa- 
tions Zij (i = 1, . . . ,p, J = 1, . . . , r) from the same model as yij. Then the 
difference of scaled mean squared prediction errors of j/i. and y.. is given 

by 



pra^ pra^ 



pa^ pr 
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for any p and r. First assume that A^i is true. We see that 

A[y..;y..] = -(p- 1)/{M < 0, 

which is reasonable. Then assume that A4a+i is true. When r — s- oo and 
p is fixed, 

hm A[y..;y,.] =^^a2/(pa2) >0, 

r — >oo 

which is reasonable. On the other hand, if p —^ oo and r is fixed, 
lim A[y..;yi.] = ca ~ l/r. 

p — >oc 

Hence when ca < 1/?", is negative even if M.a+i is true. There- 

fore, from the prediction point of view, the existence of inconsistency re- 
gion for smal l ca a nd large p is reasonable. 
4. iBerger et al.l (|2003l ) considered Bayesian model selection for one-way ANOVA. 
Under known variance setup, they showed that, for any prior of the form, 

p(a) cx j^fP/^ exp(-tQ;'a/{2cr2})p(t)dt, 

with p{t) having support equal to (0, oo), the Bayes factor is consistent 
under A^i. They also showed that consistency under Ma+i holds if ca > 
0. Hence we see that their result is not extensible to the unknown case 
since our bet a -prim e prior given by (|2.5p has support equal to (0, oo). 
Moreno et al. ( 20091 ) discussed Berger's result from the different point of 



view. 



Table 1 shows frequency of choice of the true model in some cases in numerical 
experiment. We see that it clearly guarantees the validness of Theorem 13. II 



4. two-way balanced ANOVA 

In this section, we extend the results in Section [2] and [3] to two-way balanced 
ANOVA problem. We have n independent normal random variables yij^ {i = 
1, . . . ,p, j = 1, . . . , <?, fc = 1, . . . , r, n = pgr) where 

Uijk = ^i + a^ + Pj + ia(3)ij + e^jk, eijk ~ N[Q, a'^). 

We assume J^i'^i — ^j(^3 ~ ~ = as uniqueness con- 

straint. In the two-way ANOVA, the following five sub-models are important. 

Mi: E[yijk]=i^, Ma+i'- E[y^jk] = h + a^, Mb+i : E[yijk] = + P], 
Ma+b+1 ■■ E[yijk] ^ pL + a.i+ (3j, M(^a+i){b+i) ■ -E'bij/c] = A* + + Pj + ■ 

As the one-way ANOVA is re-parameterized like (|2.1I) . each model among A4a+i, 
A4b+i, Ma+b+i, and ■A4[a+i){b+i) can be also re-parameterized as follows: 

y ^ uln + Ue + e (4.1) 
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Table 1 

Frequency of choice of the true model 





p\r 


2 


5 


10 


50 


100 


2 


5 


10 


50 


100 








under Ail 






CA = 0.1 under Ma+i 




FB 


2 


0.77 


0.89 


0.94 


0.98 


0.98 


0.28 


0.26 


0.30 


0.80 


0.97 




5 


0.93 


0.99 


1.00 


1.00 


1.00 


0.19 


0.09 


0.15 


0.78 


1.00 




10 


0.93 


0.99 


1.00 


1.00 


1.00 


0.08 


0.03 


0.04 


0.79 


1.00 




50 


1.00 


1.00 


1.00 


1.00 


1.00 


0.00 


0.00 


0.00 


0.91 


1.00 




100 


1.00 


1.00 


1.00 


1.00 


1.00 


0.00 


0.00 


0.00 


0.97 


1.00 


BIC 


2 


0.53 


0.79 


0.91 


0.97 


0.97 


0.52 


0.37 


0.39 


0.83 


0.98 




5 


0.75 


0.97 


0.99 


1.00 


1.00 


0.33 


0.10 


0.12 


0.71 


0.99 




10 


0.94 


1.00 


1.00 


1.00 


1.00 


0.07 


0.01 


0.01 


0.50 


0.99 




50 


1.00 


1.00 


1.00 


1.00 


1.00 


0.00 


0.00 


0.00 


0.00 


0.99 




100 


1.00 


1.00 


1.00 


1.00 


1.00 


0.00 


0.00 


0.00 


0.00 


0.99 








CA = 0.5 under Ma+i 






CA = 


1 under Ma+i 




FB 


2 


0.48 


0.66 


0.86 


1.00 


1.00 


0.68 


0.88 


1.00 


1.00 


1.00 




5 


0.39 


0.59 


0.90 


1.00 


1.00 


0.59 


0.93 


1.00 


1.00 


1.00 




10 


0.29 


0.56 


0.95 


1.00 


1.00 


0.59 


0.97 


1.00 


1.00 


1.00 




50 


0.07 


0.55 


1.00 


1.00 


1.00 


0.52 


1.00 


1.00 


1.00 


1.00 




100 


0.03 


0.55 


1.00 


1.00 


1.00 


0.54 


1.00 


1.00 


1.00 


1.00 


BIC 


2 


0.74 


0.79 


0.92 


1.00 


1.00 


0.87 


0.95 


1.00 


1.00 


1.00 




5 


0.57 


0.61 


0.88 


1.00 


1.00 


0.79 


0.94 


1.00 


1.00 


1.00 




10 


0.26 


0.29 


0.78 


1.00 


1.00 


0.56 


0.87 


1.00 


1.00 


1.00 




50 


0.00 


0.00 


0.03 


1.00 


1.00 


0.00 


0.03 


1.00 


1.00 


1.00 




100 


0.00 


0.00 


0.00 


1.00 


1.00 


0.00 


0.00 


1.00 


1.00 


1.00 








CA = 


2 under Ma+i 






CA = 


5 under Ma+i 




FB 


2 


0.85 


0.99 


1.00 


1.00 


1.00 


0.98 


1.00 


1.00 


1.00 


1.00 




5 


0.87 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 




10 


0.89 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 




50 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 




100 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 


1.00 



BIC 2 0.97 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 

5 0.97 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 

10 0.88 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 1.00 

50 0.02 1.00 1.00 1.00 1.00 0.89 1.00 1.00 1.00 1.00 

100 0.00 0.92 1.00 1.00 1.00 0.18 1.00 1.00 1.00 1.00 
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where t/ is a n x s orthogonal matrix, 9 £ TV and 

' p — 1 under M.a+i 

g — 1 under Mb+i 

p + q-2 under Ma+b+1 

^pq-l under Al(A+i)(B+i). 

When for 7 = A + 1, B + 1, A + B + 1, and {A + \){B + 1) and Mi 
are pairwisely compared, the corresponding Bayes factors can be derived in the 
same way as in Section [2j 

BYMMaM.] - p(i/2)r({^,.-l}/2) - Wr) 

mMMs;Mi^ - r(./2)r(,(p.- i)/2) H^,^-(--v-/. 



r(l/2)r({TOr-l}/2) V Wt 



r(i)r(H2^) V Wt 

BF f>/ . . ■ . r(pg/2)r(pg(r- l)/2) /T^^X 

BFpb[A^(a+i)(b+i),Mi] - r(l/2)r({Mr-l}/2) (vw^j 

In these expressions, Wt, Wa, Wb,Wab, and are sums of squares for bal- 
anced two-way ANOVA which identically satisfy 

Wt = Wa + Wb + Wab + We 
and each sums of squares are defined as follows. 

Wt = ^fe.fe - y-?, Wa = J2{y^.. - y...)\ 

ij k ij k 

Wb = ^{y-3- - y-?.WE = Y.(y^3k - mr? 

ij k ij k 

Wab = ^{mj- - Vi - - y-j- + 

ijk 

where 

y--- = — yijk , yi- = —y2 y^j^ > ~~ XI v^j^ ' = ~ XI • 

pqr ^-^ qr ^-^ pr ^-^ r ^-^ 

ijk ^ jk ^ ik k 

As our competitor, the corresponding Bayes factor based on BIG is considered 
as follows. 



BFbic[Ma;Mi] = {pqr)-(P-^y^ ^ ^ ' Wt 
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-pqr /2 



Wt 

-pqr 1 2 



( We 



Now we give a result for model selection consistency of two-way ANOVA. The 
proof is omitted since it is straightforward in the very similar way as the proof 
of Theorem 13.11 

Theorem 4.1. 1. Assume r — cxd and p and q are fixed. 

(a) BFbic o,nd BFpB are consistent whichever the true model is. 
2. Assume p — ^ oo, q oo and r is fixed. Also let 

hm ^ = OA, hm ^ = cb, hm ^ — = cab- 

p^oo pcr^ g^oo qa'^ p.q^oo pqa 

(a) BFbic and BFpB are consistent whichever the true model is among 

Ml, Ma+1, Mb+1, andMA+B+i- 
(h) BFpB is consistent under A4(A+i){B+i) when 

r^/ir-r) <i + CA + CB + CAB <{1 + cabYIt. (4.2) 

(c) BFpB is inconsistent wnder A^(^^x)(s+i) when (j4.2p is not satisfied. 

(d) BFbic is inconsistent wnder A^(^^x)(s+i) • 
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